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Abstract — We show that several important resource allocation 
problems in wireless networks fit within the common framework 
of Constraint Satisfaction Problems (CSPs). Inspired by the 
requirements of these applications, where variables are located 
at distinct network devices that may not be able to communicate 
but may interfere, we define natural criteria that a CSP solver 
must possess in order to be practical. We term these algorithms 
decentralized CSP solvers. The best known CSP solvers were 
designed for centralized problems and do not meet these criteria. 
We introduce a stochastic decentralized CSP solver, proving that 
it will And a solution in almost surely finite time, should one exist, 
and also showing it has many practically desirable properties. We 
benchmark the algorithm's performance on a well-studied class 
of CSPs, random k-SAT, illustrating that the time the algorithm 
takes to find a satisfying assignment is competitive with stochastic 
centralized solvers on problems with order a thousand variables 
despite its decentralized nature. We demonstrate the solver's 
practical utility for the problems that motivated its introduction 
by using it to find a non-interfering channel allocation for a 
network formed from data from downtown Manhattan. 



I. Introduction 

A Constraint Satisfaction Problem (CSP) consists of N 
variables, x := (x\, . . . , Xji), and M clauses, i.e. {0,1}- 
valued functions, ($ 1 (a ? ), . . . , An assignment x is 

a solution if all clauses simultaneously evaluate to 1. In 
the context of wireless networks, we show that CSPs pro- 
vide a unifying framework that encompasses many important 
resource allocation tasks. Examples include: allocation of 
radio channels so that transmissions by neighbouring WLANs 
(Wireless Local Area Networks) or mobile phone cells do not 
interfere; the selection of packets to be XORed on each link 
in network coding; and finding a non-colliding schedule of 
time-slots in a WLAN. Unlike in traditional CSPs, however, in 
network applications each constrained variable Xi is typically 
co-located with a physically distinct device such as an access 
point, base-station or a link. The resulting communication 
constraints impose severe restrictions on the nature of the algo- 
rithm that can be used for solving the CSP. These restrictions 
are violated by existing CSP solvers, leading us to define a new 
class of algorithms that we term decentralized CSP solvers. We 
constructively establish the existence of decentralized solvers 
by introducing a family of randomized algorithms belonging 
to this class. 

Roughly speaking, decentralized CSP solvers must be capa- 
ble of finding a satisfying assignment, x, while updating each 
variable Xi based solely on knowledge of whether all of the 
clauses in which x t participates are satisfied or at least one 
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clause is unsatisfied. The following example gives a concrete 
illustration of the origins of these constraints. 

A. Motivating example: channel allocation 

802.11 WLANs are ubiquitous, but their deployments are 
unstructured and they operate in an unlicensed frequency band. 
Within this band, an 802.11 WLAN can select one of several 
channels, typically 11, to operate on. In the vast majority 
of WiFi routers channel selection is, at present, based on 
operator selection without any quantitative instruction from the 
router. Self-selection of this channel is the task we consider 
here. Co-ordinated selection is hampered by the fact that the 
interference range of a typical 802.11 device is substantially 
larger than its communication range. Consequently, WLANs 
can interfere but may be unable to decode each others mes- 
sages (illustrated schematically in Fig. [TJ. Discovery by a 
WLAN of the existence of interfering WLANs via its wired 
back-haul may be prevented by firewalls and, in any case, 
these WLANs may not know their physical location. These 
practical restrictions mandate a decentralized channel-selection 
algorithm. 

We can identify the task of each WLAN self-selecting 
a non-interfering channel with a CSP with communication 
constraints imposed by the network topology. Let xi be the 
channel selected by WLAN i € {1,...,N} and define 
M — N(N — l)/2 clauses, one for each pair of WLANs, 
that evaluates to one if the WLANs are on non-interfering 
channels or are out of interference range and is zero otherwise. 
Due to the unstructured nature of the deployment, the lack of 
shared administrative control and communication constraints, 
a WLAN cannot rely on knowing the number or the identity of 
interfering WLANs or the channels that they currently have 
selected. A practical CSP solver for this task can only rely 
on each WLAN being able to measure whether: (i) all of the 
neighbouring WLANs have selected a different channel from 
it; or (ii) if one or more neighbours have selected the same 
channel. 

In certain settings, a limited amount of communication 
of variable values between the stations hosting them may 
be feasible. For example, it may be possible for stations 
hosting a link or for a WLAN to overhear beacons or traffic 
from a subset of its interferers, or for an access point to 
communicate with other, nearby access points. Algorithms for 
channel selection that are proposed in the literature assume 
the existence of end-to-end communication for centralized 
solutions, for message passing in decentralized or gossiping 
solutions, or to co-ordinate global restart in simulated anneal- 
ing proposals e.g. (TJ, Q, 0, ||4l and references therein. This 
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[TV] we consider its speed at identifying solutions of instances 
of random k-SAT. In Section[V]we demonstrate the algorithm's 
utility in solving the problems that motivated its introduction. 
Section VI contains discussion and closing remarks. 



Fig. 1. An illustration of overlapping interference regions in neighbouring 
WLANs. Black dot indicates location of a wireless access point, shaded area 
indicates region within which communication with access point can take place 
and outer circles indicate regions within which transmissions interfere. 

information, however, tends to be opportunistic in nature and 
cannot be relied on a priori. We leave how best to exploit 
such partial information to future work, focusing here on the 
most challenging, fundamental cases where fully decentralized 
algorithms are required. 

B. Contribution 

The primary contributions of this paper are fourfold. Firstly, 
we show that CSPs provide a unifying framework that en- 
compasses several important problems in wireless networks, 
including channel selection, inter-session network coding and 
decentralized scheduling in a WLAN. Each of these problems 
is generally thought of as being distinct and requiring different 
solution approaches. We show that they are fundamentally re- 
lated as CSPs so that the decentralized CSP solver introduced 
in this paper can be used for all of them. Secondly, we define a 
new class of CSP algorithms which we term decentralized CSP 
solvers and, extending our earlier approach to graph colouring 
[5||6|, introduce a novel stochastic decentralized CSP solver 
proving that it will find a solution in almost surely finite 
time, should one exist, and showing it has many practically 
desirable properties. Thirdly, we benchmark the algorithm's 
performance on a well-studied class of application-agnostic 
CSPs, random k-SAT. For problems with up to a thousand 
variables, which are large for our motivating examples, we 
find that the time the algorithm takes to find a satisfying 
assignment is close to that of WalkSAT Q, a well regarded, 
efficient centralized CSP solver, while also possessing de- 
sirable features of Survey Propagation (Sj. That is, despite 
its decentralized nature, the algorithm is fast. Fourthly, we 
demonstrate the solver's practical utility in a complex, wireless 
resource allocation case study. 

The rest of the paper is organized as follows. In Section 
HIl we define decentralized CSP solvers and show that well- 
known algorithms for solving CSPs fail to meet these criteria. 
In Section [Til] we introduce a decentralized CSP solver. We 
prove that it finds a solution in finite time with probability one 
whenever a feasible solution exists, obtaining an upper bound 
on the algorithm's convergence rate in the process. In Section 



II. Decentralized CSP solvers 

We begin by formalizing the criteria that CSP solvers 
must possess to be of use for problems with communication 
constraints such as those outlined above. We call algorithms 
that meet these criteria decentralized CSP solvers. 

Definition 1 (CSP): A CSP with N variables, 
{%%, . . . , Xjsr}, and M clauses is defined as follows. The 
variables each take values in a finite set T> = {1, ... , D} and 
x := (xi, . . . , Xpj) £ T> . Each clause m £ M. = {1, . . . , M} 
is defined by a function <& m : T> N H> {0, 1} where for an 
assignment of variables, x, $> m (x) — 1 if clause m is satisfied 
and $ m (x) = if clause m is not satisfied. An assignment 
x is a solution to the CSP if all clauses are simultaneously 
satisfied. That is, 

x is a satisfying assignment iff min Q m (x) = 1. (1) 

This encompasses all of our examples of interest as well as 
k-SAT, the most well-studied general class of CSPs. 

A CSP solver is an algorithm that can find a satisfying 
variable assignment for any solvable CSP. 

Definition 2 (CSP Solver): A CSP solver takes a CSP as 
input and determines a sequence {x(t)} such that for any CSP 
that has satisfying assignments: 

(Dl) for all t sufficiently large x(t) — x for some 

satisfying assignment x; 
(D2) if t' is the first entry in the sequence {x(t)} such that 
x(t') is a satisfying assignment, then x(t) = x(t') for 
all t > f. 

To provide criteria that classify CSP solvers as being 
decentralized, we begin with the following definition. 

Definition 3 (Clause participation): We say that variable 
Xi participates in clause $ m (x) if the value of Xi can influence 
the clause's satisfaction for at least one assignment of the rest 
of the variables in x. 

For each variable let M.i denote the set of clauses in 
which it participates: 

Mi= |J ((J |J {m:$ m (x i ,{x j }) = 0} 

Xi£{l,...,d} Xj£{l,...,d} 

P| U U {m:$ m (xi,{x s }) = l}\. 

i I ■■'! / 

Re-writing the left hand side of eq. ([T} in a way that focuses 
on the satisfaction of each variable we have: 

x is a satisfying assignment iff min min $> m (x) = 1. (2) 

A decentralized CSP algorithm can be thought of as having 
intelligence co-located with each variable. The intelligence 
at variable Xi can only determine whether all of the clauses 
that Xi participates in are satisfied or that at least one clause 



3 



is unsatisfied and must make update decisions locally based 
solely on this knowledge. 

Definition 4 (Decentralized CSP Solver): A decentralized 
CSP solver is a CSP solver that for each variable Xj, i G 
{1, ...,N}, must select its next value based only on an 
evaluation of 



min <£>, 



(3) 



(D4) 
(D5) 



That is, for each variable Xi the solver must make a decision 
on the next value of x, based solely on knowing whether 
that variable is currently satisfied or not without explicitly 
knowing: 

(D3) the assignments of other variables, Xj for any j ^ i; 
the set of clauses that any variable, including itself, 
participates in, Mj for j £ {1, . . . , N}; 
the functions that define clauses, <£> m for m G 
{1,...,M}. 

Note that the properties (Dl) (D2) of any CSP solver mean 
that a decentralized CSP solver must, without explicit com- 
munication, settle upon a satisfying assignment the first time 
one is found. Communication to co-ordinate global stopping 
or restarting of the solver would be contrary to the nature of 
the natural constraints of these problems and so is forbidden. 
The stochastic algorithm we define in Section III shall 



provably satisfy the properties of a decentralized CSP solver 
with probability one. Moreover, it will also have the desirable 
property that it will automatically restart its search, without 
communication, upon any change to the clauses of a CSP that 
makes the current variable assignment no longer satisfying. 
This is significant in practical applications as, for example, 
the arrival of new transmitters in a wireless network would 
induce a change the associated CSP. 

A. Formulating Wireless Network Tasks As CSPs 

Before proceeding, we demonstrate that several important 
resource allocation tasks in wireless networks fall within 
this CSP framework, which, therefore, provides a unifying 
framework for analyzing these tasks. 



1) Graph colouring. As briefly described in Section I-A 



the channel assignment problem corresponds to decentralized 
graph colouring. In the simplest model, the network is de- 
scribed by an undirected interference graph G — (V, E) with 
vertices V and edges E. Each vertex represents a WLAN 
and an edge exists between two WLANs if they interfere 
with each other when on the same channel. Define N = \V\ 
and M — \E\ and let T> denote the set of available colours 
(channels) and let x$ be a variable with value equal to 
the colour selected by an intelligence co-located with each 
vertex i G V. Each clause m G {1, . . . , \E\}, which is an 
enumeration of the elements of E, can be identified with an 
edge G E. We define 



1 if Xi ^ Xj 
otherwise. 



The participation set, Mi, of a variable xi consists of all 
clauses where the vertex i is one end of the associated edge, 

Mi = {m= : G E] . 



A variable assignment satisfies this CSP if and only if it is 
also a proper colouring for the graph G. Proper colourings 
correspond to channel assignments in which no two neigh- 
bouring WLANs in the interference graph have selected the 
same channel. To find a proper colouring, when one exists, 
a decentralized CSP solver requires only that an intelligence 
co-located with each vertex i can measure whether: (i) all of 
its neighbours are using a different colour from vertex i; or 
(ii) at least one neighbour has selected the same colour as 
vertex i. This information is sufficient to evaluate eq. |3]l and, 
in particular, the intelligence at each vertex does not need to 
know: (D3) the colour selected by any other vertex; (D4) its 
set of neighbours; or (D5) the exact nature of the clauses $ m , 
to G {1, . . . ,M}. 

2) Channel assignment with channel-dependent interfer- 
ence. Whether or not transmitters interfere in the channel 
assignment task may depend on the radio channel selected. 
This can arise due to frequency dependent radio propagation 
or to channel dependent spectral masks. That is, for regulatory 
reasons, different spectral masks are typically used when 
transmitting on channels at the edge and on channels in the 
middle of a radio band. This problem then has a collection 
of conflict graphs, one for each possible radio channel. This 
version of the channel assignment task can also be formulated 
as a CSP even though it is no longer a graph colouring 
problem. Let G(c) = (V,E(c)), c G T) be a set of undirected 
graphs with the same vertices V but possibly differing edge 
sets E{c). Again each vertex represents a WLAN, but the 
interference graph is channel-dependent. The Graph G(c) is 
associated with radio channel c and an edge exists in E(c) if 
two WLANs interfere while on channel c. Let Xj be a variable 
with value equal to the channel selected by each vertex i G V. 

Each clause to G {1, ... , \E(1)\ H h \E(D)\}, which is an 

enumeration of all edges in all graphs, can be identified with 
a c G V and an edge G E(c) and is defined by 



if Xi — Xj = c 

1 otherwise. 



The collection of clauses that variable Xj, associated with 
vertex i, participates in can be identified as 

Mi = [j {to ee (i,j,c) : G E(c)}. 

c£T> 

With Xi being the current channel selection of vertex i, for 
all c ^ Xi the clauses associated with to ee (i,j,c) are 
automatically satisfied. Thus to evaluate eq. Q it is sufficient 
for the station to measure if no neighbour coincides with 
its current channel selection or if at least one does. To find 
an interference-free channel assignment, a decentralized CSP 
solver requires that each vertex i can measure whether: (i) all 
of its neighbours on the currently selected channel are using 
a different channel from itself; or (ii) at least one neighbour 
has selected the same channel. 

3) Inter-session network coding. Network coding has been 
the subject of considerable interest in recent years as it offers 
the potential for significant increases in network capacity (9J, 
OH, ifTTl . In network coding, network elements combine 
packets together before transmission rather than forwarding 



4 



them unmodified. The combined packets can be from indi- 
vidual flows, known as intra-flow coding, or across multiple 
flows, inter-flow coding. 

While intra-flow coding within multicast flows has been 
well studied, inter-flow coding between unicast flows has 
received less attention, yet is, perhaps, of more immediate 
relevance to current Internet traffic. Inter-flow coding is known 
to be challenging lfl2ll . The task of a network finding a feasible 
linear network code in a distributed fashion, but with some 
global sharing of calculations, is investigated in lfl3ll through 
the use of a genetic algorithm. This task can be formulated as 
a CSP. 

Let G = (V,E) denote a directed acyclic multi-graph 
representing the network with vertices V and edges E. Each 
vertex represents either a source of an information flow, a 
destination or router in the network. Edges represent physical 
connections between these elements. Time is slotted and each 
edge can transmit a single packet per slot. We allow multiple 
edges between vertex pairs in order to accommodate higher 
rate links. It is the goal of each link beyond the source to 
determine a linear combination of its incoming packets to 
forward so that ultimately all flows get to their destinations. 
As the combination of packets coded by each link will take 
care of routing, each flow p E P is defined by its source, 
cr(p) € V, and its destination S(p) € V. Each flow is assumed 
to be unit rate with higher rate flows accommodated by flows 
with the same source and destination vertices. We will treat the 
collection of source, {J pe p{&(p)}, and destination vertices, 
{J pe p{5(p)}, as special, solely having one incoming and one 
outgoing edge per flow respectively. 

For each edge i E E, define s(i), s : E n- V, to be its 
source vertex and t(i), t : E H> V, to be its target vertex. 
For each vertex v E V we define the set of incoming edges 
I v := {i E E : t(i) = v} and the set of outgoing edges 
O v := {i E E : s(i) = v}. For each edge not corresponding to 
source, i 6 E\{J peP O a ( p ), we associate a variable Xi E T> := 
{l,...,2l p l}, which is an enumeration of the power set of the 
set of flows P. We define a bijective map ip : T> i-> {0, 1}' P ' 
that determines a vector whose positive entries correspond to 
flows to be coded. 

In total, we have \E \ Up^<r(p)l clauses. One for every 
link that isn't an outgoing link from a source vertex. At each 
time slot, each link that doesn't correspond to a source or 
final destination link, i € E \ V] p {O a ( p ) U Is( P )), wishes to 
forward a packet consisting of XORed packets (corresponding 
to addition in the Galois field of two elements) from the flows 
indicated by tp(xi). Each final link i g Up-^(p) wishes to 
forward packets from p, which we indicate by the vector -f p 
with a 1 at the location p and zeros elsewhere. A link and its 
immediate upstream neighbouring links will be dissatisfied if 
it cannot do so. 

More formally, for each edge not corresponding to the 
outgoing link from a source or incoming link to a destination, 
i *= ^\U P ep(0 ( T(p)U/ 5 (p)), we have a clause m = i, <& m (f), 
defined by 

\ if 39 s.t. *(i s(l) )e T =i}{ Xl ) T 
otherwise. 



where 9 is a binary vector and ^{I s u\) is the rectangular 
matrix consisting of ip(xj) T for j 6 \ UpepO^p) and 
and 7J if i g CT (p) f° r some p. 

To complete the CSP, we need to ensure that packets 
from flow p can be decoded at their destination 5(p). For 
i E UpeP ^<5(p)' tnen w ^ ^(p') ~ ^(*) we introduce a clause 



1 if3 0s.ttf(J, (i) ) 
otherwise. 



T 



where again 9 is a binary vector and ^ is is the rectangular 
matrix defined above. The set of clauses a link variable Xi, 

i € E \ (Up S p CT (p) U Ig(p)), participates in is 

Mi = {m = j : j = i or i E I s ^} 

Any variable assignment then satisfies this CSP if and only if 
it is a realizable network code satisfying all flow demands. 

Hence, to find a proper assignment a decentralized CSP 
solver requires only that each link i can determine whether: (i) 
its own coding is realizable and the coding of its immediately 
down-stream links are realizable; or (ii) if at least one of these 
is not realizable. This is sufficient to evaluate eq. (j3j. Each link 
does not need to explicitly know: (D3) the code selected by 
any other link; (D4) the network topology; (D5) any details 
of how code realizability is determined at any link. 

4) Decentralized transmission scheduling. In an Ethernet or 
WLAN it is necessary to schedule transmissions by stations. 
This might be achieved in a centralized manner using TDMA, 
but it can also be formulated as a decentralized problem. 
The classical CSMA/CA approach to decentralized scheduling 
never settles to a single schedule and some comes at a cost 
of the possibility of continual collisions. Recently, there has 
been interest in decentralized approaches for finding collision- 
free schedules, see [14], |[T5l . This task can be formulated as 
a CSP as follows. Let V denote the set of transmitters in the 
WLAN, T denote the set of available time slots and define 
N = \V\ and M = N(N - l)/2. Let x< be a variable with 
value equal to the transmission slot selected by transmitter 
i E V. Define a clause <J? m (x) associated with each pair of 
transmitters m = such that 



The participation sets are 



1 if Xi 7^ Xj 
otherwise. 



Mi = \J{r 



$ m (Xi,Xj:j e i,..) 



Any variable assignment satisfies this CSP if and only if it is 
also a collision-free time-slot schedule. To find a collision-free 
schedule, when one exists, a decentralized CSP solver requires 
only that each transmitter i can measure whether: (i) all of its 
neighbours are using a different time-slot from transmitter i; 
or (ii) at least one transmitter in the WLAN has selected the 
same time-slot as transmitter i. Again, this is all that is needed 
to evaluate eq. Q. Each transmitter does not need to know: 
(D3) the time-slot selected by any other transmitter; (D4) the 
set of transmitters; (D5) the clauses. 
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B. Related work - existing algorithms are not decentralized 

The literature on general purpose CSP solvers is vast, 
typically focusing on solving k-SAT problems in conjunctive 
normal form, but they can be broadly classified into those 
based on: (i) the Davis-Putnam-Logemann-Loveland (DPLL) 
algorithm 1161 . 02); (ii) Survey Propagation J8); and (iii) on 
Stochastic Local Search (SLS) Q. Each of these approaches 
has experienced substantial development and has its own 
merits, but none were motivated by problems where variables 
have a geographical sense of locality. 

Algorithms developed from the DPLL approach have proved 
to be the quickest at SAT-Race and SAT Competition in recent 
years, e.g. ManySAT 11X81 . The DPLL approach ultimately 
guarantees a complete search of the solution space and so 
meets the (Dl) and (D2) criteria. They are, however, based 
on a branching rule methodology, e.g. JT9|, that assumes the 
existence of a centralized intelligence that employs a back- 
tracking search. The implicit assumptions of the information 
available to this intelligence breaks the conditions (D3) (D4) 
(D5) and so they are not decentralized CSP solvers. 

Survey propagation, a development of belief propagation 
[20 1 from trees to general graphs, has proved effective in 
graphs that do not contain small loops 0211 . For a given CSP, 
the fundamental structure of study is a called a factor graph. 
In order to generate this, it is necessary to know what clauses 
each variable participates in and the nature of each of the 
clauses, breaking the (D4) and (D5) criteria and so these are 
not decentralized CSP solvers. 

SLS algorithms also depend fundamentally upon the ex- 
change of information, mostly in an explicit manner breaking 
the (D3) condition by basing update decisions on relative 
rankings of the constraint variables but also in a more sub- 
tle fashion. To see this implicit requirement, consider the 
following algorithm for binary valued variables originally 
proposed by Papadimitriou l22l and developed further by 
Schoning [23|. Pick a random assignment of values for the 
constraint variables. Repeat the following: from all of the 
unsatisfied clauses, pick one uniformly at random, select one 
of the variables participating in that clause and negate its 
value, breaking the (D4) and (D5) conditions. The algorithm 
halts when all clauses are satisfied or a specified time limit 
expires. Although simple, this forms the basic building block 
for all SLS algorithms, including the well-studied WalkSAT 
algorithm Q. It is important that a single unsatisfied clause is 
selected at each step and that a single variable within the clause 
is adjusted as it is this that leads to the algorithm behaving as 
a random walk [23|. Thus, again, solvers in this class are not 
decentralized. 

III. A DECENTRALIZED CSP SOLVER 

We now introduce an algorithm that satisfies the decentral- 
ized CSP solver criteria. 



Pi j over j G D as well as the current variable value Xi. In 
pseudocode, the CFL algorithm is: 



ALGORITHM 1: Communication-Free Learning 

1: Initialize pi j = 1/d, j G {1,...,D}. 
2: loop 

3: Realize a random variable, selecting Xi = j with probability 

Pi,]- 

4: Evaluate minmgA-i; <&m(x), returning satisfied if its value is 
1, as this indicates all of variable i's clauses are satisfied 
given the present assignment, and unsatisfied otherwise. 

5: Update: If satisfied, 

_ ( 1 if j = Xi 
Pi ' ] ~ \ otherwise. 

If unsatisfied, 

_ Ul-b)pi,j + a/(D -1 + a/b) if j = Xi 
Pi,j ~ 1 (1 - b)pij + b/(D - 1 + a/6) otherwise, 

where a,b G (0, 1] are design parameters. 
6: end loop 



To understand the logic behind CFL, note that for each 
variable a probability distribution over all possible variable- 
values is kept. The variable's value is then selected from this 
distribution. Should all the clauses that the variable participates 
in be satisfied with its current value, the associated probability 
distribution is updated to ensure that the variable value remains 
unchanged. If at least one clause is unsatisfied, then the 
probability distribution evolves by interpolating between it and 
a distribution that is uniform on all values apart from on the 
one that is presently generating dissatisfaction. Thus if all of 
a variable's clauses were once satisfied, the algorithm retains 
memory of this into the future. This has the effect that if a 
collection of variables are content, they can be resistant, but 
not impervious, to propagation of dissatisfaction from other 
variables. 

CFL possesses two parameters. The parameter b determines 
how quickly the past is forgotten, while a determines the 
algorithm's aversion to a variable value should it be found 
to cause dissatisfaction. Even though an instance of CFL is 
run for each variable, we shall show that this completely 
decentralized solution is a CSP solver. 

From here on we will assume that the update Step 5 is 
performed in a synchronized fashion across variables. Without 
synchronization, the fundamental character of the algorithm 
doesn't change, but the analysis becomes more involved. 
Synchronization solely requires that algorithm instances each 
have access to a shared sense of time and that this can be 
achieved without information-sharing or other communication 
between variables, i.e. between algorithm instances. A suitable 
clock is, for example, available to any Internet connected 
device via the Network Time Protocol (NTP). 



A. Communication- Free Learning Algorithm 

An instance of the following Communication-Free Learning 
(CFL) algorithm is run in parallel for every variable. For each 
variable, i G {1,2, ...,N} it keeps a probability distribution, 



B. CFL is a decentralized CSP solver 

By construction, the only information used by the algorithm 
is min me _A/f. in Step 4 and thus it satisfies the criteria 

(D3) , (D4) and (D5) . That is, it only needs to know if all 
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clauses in which variable i participates are satisfied or if one 
or more are not. The CFL algorithm also satisfies the (D2) 
criterion that it sticks with a solution from the first time one 
is found. To see this, note that the affect of Step 5 is that if a 
variable experiences success in all clauses that it participates in 
it continues to select the same value with probability 1, Thus 
if all variables are simultaneously satisfied in all clauses, i.e. 
if mini min m6 ^vi . = L then the same assignment will 

be reselected indefinitely with probability 1. 

To establish that the CFL algorithm is a decentralized CSP 
solver all that remains is to show that it meets the (Dl) 
criterion, that if the problem has a solution it will be found, 
which is dealt with by the the following theorem. It provides 
an upper bound on the distribution of the number of iterations 
the algorithm requires to find a solution to any solvable CSP. 
It's proof exploits the iterated function system structure of the 
algorithm and can be found in the Appendix. 

Theorem 1: For any satisfiable CSP, with probability 
greater than 1-e £ (0, 1) the number of iterations for the 
CFL algorithm to find a satisfying assignment is less than 



(N{N- 
P \2 



^log^We- 1 ), 



where 7 



min(a, b) 
D-l + a/b' 

For a CSP corresponding to graph coloring, a tighter bound 
holds and a satisfying assignment will be found with proba- 
bility greater than 1 — e in a number of iterations less than 

iVexp(27Vlog(7" 1 ))log(e~ 1 ). 

Theorem[T]proves that for fixed N the tail of the distribution 
of the number of iterations until the first identification of 
a satisfying assignment is bounded above by a geometric 
distribution and so all of its moments are finite. As TheoremQ] 
covers any arbitrary CSP that admits a solution, for any given 
instance these bounds are likely to be loose. They do, however, 
allow us to conclude the following corollary proving that if a 
solution exists, the CFL algorithm will almost surely find it. 

Corollary 2: For any CSP that admits a satisfying assign- 
ment, the CFL algorithm will find a satisfying assignment in 
almost surely finite time. 

Hence the CFL algorithm satisfies all of the criteria (Dl) 
(D2) (D3) (D4) (D5) , almost surely, and so is a decentralized 
CSP solver. 

C. Parameterization 

Theorem [T] establishes that the CFL algorithm provably 
identifies satisfying assignments for all values of its two design 
parameters, a and b. The value of a determines the algorithm's 
aversion to variable values for which clause failure has been 
experienced. The value of b impacts on the speed of conver- 
gence of the algorithm. Optimal values of a and b depend upon 
each problem and a performance metric. For fast convergence 
across a broad range of CSPs with distinct structure, we have 
found that small values of a and b, corresponding to strong 
aversion to a dissatisfying variable value and reasonably long 
memory, work well. Thus for simplicity, we set a = b in all 



experimental examples. We use b = 0.2 for random 3-SAT, 
b = 0.1 for random 4-SAT and b = 0.05 for random 5-SAT. 
We use b — 0.1 for our wireless networks example, a value 
which we have found to yield good performance across a range 
of fc-SAT problems. 

IV. Application-agnostic benchmarking 

A CSP associated with a specific networking task will 
possess structure induced by its topology. Before considering 
practically-motivated applications in Section [V] as the CFL 
algorithm can solve any CSP but its design was subject to 
restrictions not normally considered it is prudent to investigate 
its performance as a CSP solver in an application-agnostic 
setting. 

We present simulation data evaluating the performance of 
the CFL algorithm for random k-SAT, a well-studied class of 
CSPs. A k-SAT problem is a CSP in which all variables are 
binary-valued and clauses consist solely of logical disjunctions 
of no more than k variables. In random k-SAT, an instance of 
k-SAT is generated by drawing M such clauses uniformly at 
random 11221 . Our investigation surprisingly reveals that the 
CFL algorithm is competitive with centralized and distributed 
solvers on problems of reasonable size (order 1000 variables). 

Briefly, we first review current knowledge regarding random 
k-SAT. The behavior of random k-SAT is known to depend 
strongly on the parameter r = M/N with phase transitions and 



associated thresholds r- 



st,k 



and r P 



st,k 



If r > T- 



st.k 



the constraint problem is unsatisfiable with high probability, 
while if r < r ex i St .k a satisfying assignments exist with 
high probability. Evidently, r existyk < r^ exist>k and it is 
conjectured that r ex i St ,k — r^ e xist,k l25l . A simple argument 
gives r-, ex i St ,k < 2 fc log 2 and this upper bound can be refined 
to obtain the values in the second row of Table U Also shown 
in row four of this table are estimated values for r-, ex i St , k 
derived from statistical physics considerations. These latter 
estimates are supported by experimental data for k — 3, e.g. 
1 30], although there are fewer experimental studies for k > 3. 



It can be seen that the theoretical bound for r- 



st,k 



and the 



estimated values are in good agreement for k > 5, and that 
both approach 2 k log 2 for large k. Recent mathematical results 
have established that r existk > 2 k log 2— k [24 1 and this lower 
bound can be tightened to obtain the theoretical values shown 
in the third row of Table U 

There also exists a threshold r po iy :k below which a sat- 
isfying assignment can be found in polynomial time with 
high probability. This threshold has been the subject of much 
interest and the current best analytic lower bound for r po i Vtk 
is also indicated in Table [I] Statistical physics considerations 
have led to the conjecture that r po i Vtk is equal to the value 
r iRSB.k at which the one-step replica symmetry breaking 
(1RSB) instability occurs JSTJ. Current best estimates for this 
value are given in Table [I] For k > 8, it has been proven 
analytically that the set of satisfying assignments is grouped 
into widely separated clusters, which lends support to this 
conjecture [32]. For values of k < 8 the situation is less clear, 
with experimental evidence indicating that r po i ytk lies above 
fiRSB.k for k = 3,4 and 5 EH. 
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9.93 


21.12 


87.79 


708.91 
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4.15 


9.38 


19.16 


62.5 







TABLE I 

Current best upper/lower bounds on the phase transition thresholds in random k-SAT. 
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Fig. 2. Median normalized stopping time of CFL algorithm vs r for random 
3-SAT and N = 100. Each point is derived from at least 1000 runs of the 
algorithm; algorithm terminated after 10 7 iterations if no satisfying assign- 
ment found. For comparison, stopping time measurements for WalkSAT taken 
from 1 30 Fig 2a] are also indicated. (Inset) CFL algorithm measurements for 
random 4-SAT and 5-SAT. Measurements shown are for values of r up to 4.20 
(3-SAT), 9.90 (4-SAT), 21.10 (5-SAT); close to the current best estimates for 
r e xist,k and well above r 1RS B,k- 



o 


3 


-SAT/r=1 


• 


5 


-SAT/r=5 





3 


-SAT/r=2 


• 


5 


-SAT/r=10 


o 


3 


-SAT/r=3 


• 


5 


-SAT/r=15 


o 


3 


-SAT/r=4 









• • • • 

^ooo O O o 

° o O o 

° * 5 



o o o o o 



10 - 000 o S S • . . . . 



o o o o 



200 



o o o $ 

• • • 

o O o 

• • » 

° o o 

800 1000 



Fig. 3. Normalized stopping time of CFL algorithm vs N and r for 3-SAT 
and 5-SAT. Each point is derived from at least 1000 runs. 



We now consider the performance of the CFL algorithm. 
Fig. [2] gives median measurements of normalized stopping 
time (stopping time divided by N) of the CFL algorithm vs 
r=M/N when the number of variables N = 100. Fig. [3] gives 
median measurements of normalized stopping time vs N with 
r held constant. Data is shown for random k-SAT with k = 3, 4 
and 5. Observe two striking features from this data. Firstly, 
from Fig. [2] noting that a log scale is used on the y-axis, the 
median normalized stopping time increases exponentially with 
r when N is held constant . Secondly, from Fig. |3]the median 
normalized stopping time vs N, with r held constant, is upper 
bounded by a constant. That is, a satisfying assignment is 
found in a time with median value that increases no more than 
linearly with N. This linearity holds even when r is close to 
fexist,k (data is shown in Fig. [3] for 3-SAT with r = 4) and 
is of great practical importance as it implies that with high 
probability the CFL algorithm finds a satisfying assignment 
in polynomial time. 

Comparing the performance of the CFL algorithm with 
the popular WalkSAT algorithm, we note that the WalkSAT 
algorithm also exhibits an exponential-like dependence of 
stopping time on r and linearity in N ll30l . [34|, [33]. To 
allow comparison in more detail, median stopping time data 
taken from 11301 Fig 2a] is marked on Fig. [2] For r < 3.9, 
the median stopping time is similar for CFL and WalkSAT. 
Above this value, however, the stopping time for WalkSAT 
diverges, increasing super-exponentially in r. This divergence 
is a feature not only of WalkSAT but also of other local search 
algorithms algorithms e.g. ChainSat 1331 . It is not exhibited 
by the Survey Propagation algorithm l30l . JD, which has been 
the subject of considerable interest as it creates the ability to 
operate close to the r eX i s t,k threshold. Observe that the CFL 
algorithm also does not exhibit divergence as r approaches 
f e xist,k- These comparisons are encouraging as they indicate 
that the CFL algorithm is competitive with some of the most 
efficient general-purpose k-SAT algorithms currently available. 
It is also unexpected as information sharing is a key compo- 
nent of both WalkSAT and Survey Propagation, whereas CFL 
makes local decisions simultaneously with no use whatsoever 
of information-sharing, raising fundamental questions as to the 
role of message passing, the relationship between information 
exchange and algorithm performance and, in particular, what 
performance cost is imposed by constraining attention to 
decentralized operation. 
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V. Case study: channel allocation in 802.11 
WLANs 

Having established general properties of CFL and deter- 
mined its performance on random instances of k-SAT, we re- 
turn to a problem of the sort that motivated its introduction. We 
consider the performance of the CFL algorithm in a realistic 
wireless network case study. From the online database WIGLE 
ll35l we obtained the locations of WiFi wireless Access Points 
(APs) in an approximately 150m 2 area at the junction of 5th 
Avenue and 59th Street in Manhattan^] This space contains 
81 APs utilizing the IEEE 802.11 wireless standard. It can 
be seen from Figure [4] which plots the mean number of APs 
lying within distance d of an AP, that within a 15m radius an 
AP has on average 3 neighbours and within a 30m radius it 
has on average 10 neighbours. Of the 11 channels available 
in the 802.11 protocol, only 3 are orthogonal. Thus managing 
interference in this dense deployment is a challenging task. 

Imagine a worst-case scenario where after a power-outage 
all these APs are switched back on. The aim of each AP is to 
select its radio channel in such a way as to ensure that it is 
sufficiently different from nearby WLANs. This can be written 
as a CSP where we have N = 81 APs and N variables Xi 
corresponding to the channel of AP i, i = 1, 2, N. As per 
the 802.11 standard [36 1 and FCC regulations, each AP can 
select from one of 1 1 radio channels in the 2.4GHz band and 
so the Xi, i = 1,2, ,.,N take values in T> = {1,2,.., 11}. 
To avoid excessive interference each AP requires that: (a) 
no other AP within a 5m distance operates closer than 3 
channels away; (b) no AP within a 10m distance operates 
closer than 2 channels away and (c) no AP with a 30m distance 
operates on the same channel. We can realize this as a CSP 
with 3 constraint clauses per AP, giving 3N in total, $> m (x), 
m = l,...,3iV. With e(i,j) denoting the Euclidean distance 
between APs i and j in metres, for m — 1, N 



for m = N + 1 , 



1 if min.j :e (> n ,j)<5 \x m — Xj 
otherwise, 

.,2N 

1 if min J -. e ( m _ JV j)<io \x m -N " 
otherwise, 



> 3 



and for m = 2N + 1,...,3N 



1 if min J:e ( m _ 2A r J ) <30 \x m -2N 
otherwise, 



> 2 



> 1 



The attenuation between adjacent channels is — 28dB 
Taking the radio path loss with distance as d a , where d is 
distance in meters and a = 4 the path loss exponent, then 
these constraints ensure > 60 dB attenuation between APs. 

Assuming all APs use the maximum transmit power of 
18dBm allowed by the 802.11 standard, this means that the 
SINR is greater than 20dB within a 10m radius of each AP 
which is sufficient to sustain a data rate of 54Mbps when the 



'The extracted (x,y,z) coordinate data used is available online at 
hamilton.ie/net/xyz.txt 
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Fig. 4. Mean number of APs within distance d of an AP vs distance d. 



60- 




50- 




40- 




30- 






> 






20 












10 










* • 



- B - «^ 



120 100 80 



40 20 
x (m) 




y (m) 



Fig. 5. A satisfying assignment of radio channels. Each dot marks the location 
of a WiFi wireless access point and is based on measurements taken at the 
junction of 5th Avenue/59th Street in Manhattan. The color of a dot indicates 
the radio channel. To avoid interference between transmissions, nearby access 
points need to operate on radio channels that are spaced sufficiently far apart. 
There are 1 1 radio channels available to choose from, but the frequencies of 
these channels overlap so that only 3 orthogonal/non-overlapping channels 
are available; there are 81 wireless access points in total. 



connection is line of sight and channel noise is Gaussian 

02. 

Observe that each variable x^ is located at a different AP. 
The APs do not belong to a single administrative domain 
and so security measures such as firewalls prevent commu- 
nication over the wired network. An AP cannot rely on 
decoding wireless transmissions to communicate with all of 
its interferers or even just to identify them. This is because 
interferers may be too far away to allow their transmissions 
to be decoded and yet still their aggregate transmission power 
may be sufficiently powerful to create significant interference. 
That is, an AP cannot know the channel selections of other 
APs, condition (D3) and, moreover, cannot reliably identify or 
enumerate the clauses in which it participates, condition (D4) 
. A decentralized algorithm is therefore mandated. 

Fig. [5] shows an example satisfying assignment of radio 
channels obtained by running an instance of the CFL algorithm 
at all APs. The complexity of the topology generated by the 
physical location of the APs and the non-uniformity of the 
clauses it causes is apparent. 

Fig. [6] shows the measured distribution of number of itera- 
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Fig. 6. Log of the empirical complementary cumulative distribution of 
convergence time in iterations, based on 12,000 runs of CFL algorithm for 
5th Avenue data. The median is 21 and the 95% percentile is 98 iterations. In 
current hardware one iteration can readily be performed in under 10 seconds, 
leading to a median time to convergence of less than 4 minutes. 

tions required to find a satisfying assignment, whereupon the 
algorithm natural halts in a decentralized fashion. The median 
value is 21 iterations and the 95% centile is 98 iterations. Note 
that during this convergence period, although the network is 
operating sub-optimally, it does not cease to function. In a 
prototype lab set-up we have shown that a CFL update interval 
of less than 10 seconds is feasible on current hardware. Thus 
the median time to convergence is under 4 minutes. This is 
a reasonable time-frame for practical purposes, particularly as 
subparts of the network are functioning during this conver- 
gence period, and thus the CFL algorithm offers a pragmatic 
solution to this difficult decentralized CSP for which existing 
solvers could not be employed. 

VI. Conclusions 

We have shown that apparently distinct problems in net- 
working can be placed within the framework of CSPs, where - 
unlike with traditional motivating examples - the variables are 
co-located with devices that may not be able to communicate. 
We define the criteria that practical solvers must possess 
in order to be suitable for theses problems, labeling them 
decentralized CSP solvers. 

As existing solvers fail to meet one or more of these criteria, 
we introduce a decentralized algorithm for solving CSPs. We 
prove that it will almost surely find a satisfying solution if 
one exists. In doing so, we generate bounds for the speed of 
convergence of the algorithm. We suspect, however, that our 
bounds for a general CSP are not tight and conjecture that the 
real bound should be closer to the one we have for the specific 
case of CSPs corresponding to graph coloring. 

Given how much information decentralization is sacrificing, 
surprisingly an experimental investigation of solving random 
k-SAT instances suggests that the algorithm is competitive 
with two of the most promising centralized k-SAT solvers, 
WalkSAT and Survey Propagation, on instances of random k- 
SAT with order one thousand variables. This raises the ques- 
tion: what is the performance cost of decentralized operation? 
This is particularly pertinent as the decentralized nature of the 



CFL algorithm lends itself to parallelized computation as it 
has a small, fixed memory requirement per variable, making it 
suitable for use in circumstances where a centralized algorithm 
could also be used but would be difficult to implement in a 
distributed fashion. 

Appendix 

For each i £ {l,...,iV} let $(i) £ [0, l] 23 , t £ N, be 
the CFL probability vector for variable i at time t and Xj(i) 
be the variable's value selected stochastically from Pi(t). Let 
P{t) = (fi(t),...,? N (t)) and X(t) = {xi{t),...,x N {t)) 
record the overall state of the CSP-CFL system. 

The state of the probability vectors {P(t)} forms a Markov 
Chain, or Iterated Function System, with place dependent 
probabilities l38l . The convergence time of the algorithm is 
the first time the chain enters an absorbing state representing a 
valid solution to the CSP. The absorbing states are identifiable 
in terms of the variable values {X(t)}: 

A = \j{x: $ m (x) = 1 for all m £ M} , 

as if X(t) £ A, then P(t + 1) = (S Xl ,...,5 XN )so that X(t + 
1) = X(t) almost surely and hence a solution to the CSP has 
been found. The algorithm's stopping time is 

r := inf \X{t) £ A}. 

t>Q 

Define 7 = min(a, b)/(D — 1 + a/b) and let S be the set 
of states such that for all i £ J\f either p. t = 5k for some 
k £ {1, ■■■,£>} or (pi) k > 7 for all k £ {1, ■ • • , M}. The 
next lemma is obvious. 

Lemma 1: For any integer t > 0, P(t) £ S. 

Theorem [7J The method of proof for both statements is 
similar: we create a sequence of events over N — 1 iterations 
that, regardless of the initial configuration, lead to a satisfying 
assignment with a probability that we find a lower bound 
for. Due to the Markovian nature of the algorithm and the 
independence of the probability of this event on its initial 
conditions, if this event does not occur in N — 1 iterations, 
it has the same probability of occurring in the next N — 1 
iterations. This is what leads to the geometric nature of the 
bounds. The difference between a general CSP and those 
corresponding to coloring is that for the latter the variables 
are ensured to experience fewer dissatisfaction events before 
finding a satisfying assignment. 

First consider a general CSP. Select an arbitrary valid 
solution a £ A with components (a)i. For each m £ A4 
define AT m to be the variables that participate in clause to. 
For each t > 0, define the set of unsatisfied variables at time 
t by 

U t := (J {M m : $ ro pT(t)) = 0} 

and the number of unsatisfied variables to be = |f7 t |. At 
time assume X(0) — x(0), some x(0). If Uq — then 
X(Q) £ A, the algorithm has found a solution and r = 0. If 
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Uo 7^ 0, the algorithm has not yet found a solution. Define the 
components, x(l)i, of the vector x{\) by 



if i £ U Q 
if i<£U 



and, by Lemma [T] 

P(X(1) 



x(l)\X(0) = x(0)) > Y 



If Ui = then X(l) £ A, the algorithm has found a solution 
and t = 1. If Ui 0, the algorithm has not yet found a 
solution and we proceed in an iterative fashion. For t > 0, 
define the components, x(t + of the vector x(t + 1) by 



x(t + 1), = 



3i if i G C/f 
if i ^ J7j 

and, by Lemma [T] 

P(X(t + 1) = £(i + = j?(t)) > 7 nt . 

Let t* = inf{£ : n t = 0} and note that 

P{X(t*) £ A) > 

P{X(t*) = 2(f), X(0) = x (0)) > 7 ^*=» ™* . 

We wish, therefore, to place an upper bound on the sum in 
the exponent. Observe that t* < N — 1, as one starts with at 
least one unsatisfied variable at each stage we me must include 
at least one new variable and, therefore, this procedure must 
terminate in no more than N— 1 steps with a valid assignment. 

In general, the {nl } sequence that maximizes this sum, even 
though this sequence may not be feasible for a given CSP, is 



1, 71, 



This occurs if we start with one unsatisfied variable and, in 
changing to the valid solution, at each iteration one additional 
variable has cause to be unsatisfied. On taking its new value, 
the unsatisfied variable included at time t triggers the failure 
of t — 1 clauses each of which contains 3 variables: itself, the 
variable included immediately previously and one of the other 
variables that has already experienced a failure. This gives 



t* 

£' 

t=0 



N 

E h * 

t=0 



N(N + 1) 



and therefore 



p(x(n - 1) g a) > y^+D/ 2 . 

For a CSP corresponding to graph coloring with D colours, 
the advantage is that variables cannot be dissatisfied indefi- 
nitely by newly dissatisfied variables. Instead this procedure 
generates a flame-front of dissatisfied variables where those 
variables sufficiently far within the interior of this flame- 
front can select their final value without further disturbance. 
For graph colouring, clauses occur only for each pair of 
neighbors, i and j, such that $(2) = if (x)i = (x)j and 
$(2) = 1 otherwise. For any initial configuration 2(0) with 
dissatisfaction, we must have n > 2. In a sequence of events 
analogous to those in the proof of [6 1 [Theorem 3], these two 
variables select their final values, Si and (a)j, at the next 



round, causing their initial clause to be satisfied as a £ A. If 
either of the variables taking its final value causes no further 
clause to be dissatisfied, then in this sequence of events it will 
never appear in a dissatisfied clause again and stays with its 
value with probability 1. If it does cause one or more clause 
to be triggered, once these are satisfied it will experience no 
more dissatisfaction. Thus each variable experiences at most 
2 rounds of dissatisfaction so that for graph colouring: 



AT-l 



E^E 



= 2N. 



t=o 



t=0 



As an example that illustrates the difference between a 
general CSP and graph colouring, consider an instance of 3- 
SAT starting with one unsatisfied variable, x\. The following 
sequence of clauses is possible in general, but not with graph 
colouring. Starting with an unsatisfied x\, changing its value 
to satisfy the first clause causes x-i to be dissatisfied, which 
in turn causes x\ and X2 and X3 to be dissatisfied: 

X\, ->x\ V X2 and ->xi V ^X2 V X3. 

Satisfying these clauses for x\, X2 and £3 causes two more 
clauses with X4 to trigger, keeping all variables unsatisfied: 

-1X1 V -1X2 V X4 and ->xi V ^2:3 V 24. 

Satisfying these two clauses causes three further clauses in- 
volving £5 to become unsatisfied: 

-1X1 V ^X2 V £5, -1X1 V ^X3 V £5 and ->xi V -^x^ V x$ 

and so forth triggering clauses with three variables at a time 
such that all variables are unsatisfied until a solution is found. 
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